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ABSTRACT 

Spectroscopic+photometric redshifts, stellar mass estimates, and rest-frame colors from the 3D-HST 
survey are combined with structural parameter measurements from CANDELS imaging to determine 
the galaxy size- mass distribution over the redshift range 0 < z < 3. Separating early- and late- type 
galaxies on the basis of star-formation activity, we confirm that early-type galaxies are on average 
smaller than late-type galaxies at all redshifts, and find a significantly different rate of average size 
evolution at fixed galaxy mass, with fast evolution for the early-type population, R e g oc (1 + z) -1 ' 48 , 
and moderate evolution for the late-type population, R e s oc (1 + z)~°' 75 . The large sample size 
and dynamic range in both galaxy mass and redshift, in combination with the high fidelity of our 
measurements due to the extensive use of spectroscopic data, not only fortify previous results, but 
also enable us to probe beyond simple average galaxy size measurements. At all redshifts the slope of 
the size-mass relation is shallow, R e g oc M, 2 , for late-type galaxies with stellar mass > 3 x 10 9 M 0 , 
and steep, R e g oc Mj?' 75 , for early- type galaxies with stellar mass > 2 x 10 10 M 0 . The intrinsic scatter 
is <0.2 dex for all galaxy types and redshifts. For late-type galaxies, the logarithmic size distribution 
is not symmetric, but skewed toward small sizes: at all redshifts and masses a tail of small late- type 
galaxies exists that overlaps in size with the early-type galaxy population. The number density of 
massive (~ 10 11 M 0 ), compact ( R e g < 2kpc) early-type galaxies increases from z = 3 to z = 1.5 — 2 
and then strongly decreases at later cosmic times. 

1. INTRODUCTION 

The size distribution of the stellar bodies of galaxies, 
and its evolution with cosmic time, provides important 
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clues about the assembly history of galaxies and the re- 
lationship with their dark matter halos. The two main 
classes of galaxies, early- and late types, show very dif- 
ferent dependencies between size and stellar mass (Shen 
et al. 2003). The weak dependence between size and 
mass for late-type galaxies implies that the high-mass 
late types, on average, have higher surface mass densi- 
ties than low-mass late types. In contrast, early types 
show a more complex relationship between stellar mass 
and density, with the density peaking for systems with 
stellar masses around M* ~ 4 x 10 10 M 0 and decreas- 
ing towards both lower and higher masses, as reflected in 
the classical Kormendy (1977) relation. This fundamen- 
tal difference does not depend on whether classification 
of early and late types is based on star formation activ- 
ity, bulge dominance (Sersic index) or visual inspection, 
and implies that the two types have very different evolu- 
tionary and assembly histories. 

In this paper we present the evolution of the size-mass 
distribution up to z = 3, based on spectroscopy and 
multi-wavelength photometry from the 3D-HST survey 
(Brammer et al. 2012a) and HST/WFC3 imaging from 
CANDELS (Grogin et al. 2011; Koekemoer et al. 2011). 
Angular galaxy sizes are measured from the CANDELS 
imaging as described by van der Wel et al. (2012) and 
the HST/WFC3 grism observations from 3D-HST pro- 
vide spectroscopic confirmation and redshifts for a large 
fraction of the sample, considerably strengthening, with 
respect to previous studies, the fidelity of estimates for 
stellar masses and rest-frame photometric properties. 

So far, most previous studies have focused on the evo- 
lution of average galaxy sizes of the high-mass end of 
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the distribution (> 5 x 10 10 M 0 ). Enabled by both the 
improved data quality and a five-fold increase in sam- 
ple size, we can now, for the first time, describe the size 
distribution of galaxies across redshift. 

1.1. Size Evolution of Late- Type Galaxies 

Tracing the evolution of the size distribution with red- 
shift allows us to test the most basic elements in our 
theory of galaxy formation. The zero-th order expecta- 
tion is that disk scale lengths evolve fast, approximately 
as the inverse of the Hubble parameter (Mo, Mao & 
White 1998), and early and recent work on the aver- 
age sizes of Lyman Break Galaxies (LBGs) at high red- 
shifts (z ~ 2 — 6) roughly agree with this expectation 
for a ACDM cosmology: Giavalisco et al. (1996), Fer- 
guson et al. (2004), Oesch et al. (2010), and Mosleh 
et al. (2012) all find rapid size evolution with redshift: 
R e s oc (1 + z)P=- 1A . 

In contrast, the average size at a given stellar mass of 
the population of disk-dominated galaxies evolves slowly 
at late times (z < 1) has been reported to evolve slowly 
as measured at fixed galaxy mass (/3 = —0.2) or not at 
all (Lilly et al. 1998; Ravindranath et al. 2004; Barden 
et al. 2005). The implication would be that the evo- 
lution of the disk galaxy population is decoupled from 
the evolution of the dark matter halo population. One 
fundamental difference between the results on LBGs and 
lower-redshift disk galaxies is the rest-frame wavelength 
at which the sizes are measured: the rest-frame UV light 
seen for LBGs originates from young stars that may be, 
and are generally expected to be, distributed differently 
than bulk of baryonic and stellar mass, not to mention 
the consequences of extinction. 

The advent of ground-based near-infrared imaging sur- 
veys helped to bridge the z < 1 and z > 2 regimes by 
enabling size measurements in a consistent manner at a 
fixed rest-frame wavelength. Early results suggested slow 
evolution for late- type galaxies up to z ~ 3 (Trujillo et al. 
2006a), but the uncertainties at z > 1 were such that 
evolution in that regime was not strongly constrained. 
Later ground-based work pointed at faster evolution at 
a fixed galaxy mass: Franx et al. (2008) found /3 = —0.6, 
and Williams et al. (2010) found /3 = —0.9, but precise 
constraints at z > 1.5 remained elusive and the apparent 
tension between the z < 1 work and the near-infrared at 
z ~ 1.5 unaddressed. 

Several HST/NICMOS-based studies of the morphol- 
ogy and structure of massive z ~ 2 galaxies in the rest- 
frame optical eventually led to mostly converged results, 
with /3 ~ —0.8 for massive (~ 10 11 M 0 ), star-forming 
galaxies from 2 ~ 2.5 to the present (Toft et al. 2007; 
Buitrago et al. 2008; Kriek et al. 2009b). Whether the 
difference with the significantly faster evolution for LBG 
galaxies (/3 ~ —1.1) is caused by morphological K cor- 
rections, the difference in mass (the typical LBG has 
M* ~ 10 10 M 0 ), or physical changes with redshift has so 
far remained unclear. In addition, the difference with the 
previously mentioned studies at z < 1 (Lilly et al. 1998; 
Ravindranath et al. 2004; Barden et al. 2005) remains 
unexplained. 

Improving the measurement of /3 and its mass- 
dependence is crucial to make the next step in under- 
standing disk galaxy formation. In this paper we will 


address these issues and describe the full size-mass dis- 
tribution of high-redshift galaxies over a broad range in 
galaxy mass and redshift. We will 

• measure the evolution of the slope of the size-mass 
relation; 

• present the size distribution as a function of stellar 
mass and redshift; 

• provide a consistent comparison with UV-selected, 
high-redshift samples. 

1.2. Size Evolution of Early-Type Galaxies 

Over the past five years, more attention has been be- 
stowed on the size evolution of early-type galaxies than 
on the size evolution of late-type, star-forming galaxies. 
Interest in the topic was initiated by reports that z ~ 1.5 
early-type galaxies have remarkably small sizes in HST- 
based, rest-frame UV imaging (Daddi et al. 2005; Tru- 
jillo et al. 2007) and ground-based near-infrared imaging 
(Trujillo et al. 2006b). NICMOS imaging presented by 
Zirm (2007), Toft et al. (2007), Stockton et al. (2008), 
and McGrath et al. (2008) provided space-based, rest- 
frame optical size measurements which strengthened the 
evidence for rapid size evolution (/? = —1 or faster) as 
measured at a fixed galaxy mass (~ 10 11 Mg). This 
notion became firmly established through larger samples 
(Buitrago et al. 2008) and the first spectroscopic samples 
(van Dokkum et al. 2008; Damjanov et al. 2011). 

Concerns regarding gross overestimates of the stellar 
mass content of the compact early-type galaxies were al- 
leviated by dynamical mass estimates of such galaxies 
at z > 1 (van der Wei et al. 2008; Cimatti et al. 2008; 
Newman et al. 2010; van de Sande et al. 2011; Toft et 
al. 2012; van de Sande et al. 2013; Bezanson et al. 2013; 
Belli et al. 2014), and the analysis by Szomoru et al. 
(2010) of ultra-deep imaging of a single compact galaxy 
has demonstrated the absence of low-surface brightness 
wings that could have been missed by shallower imag- 
ing. Increases in sample size and dynamic range in stel- 
lar mass have constrained the average size evolution of 
early-type galaxies with stellar masses > 5 x 10 10 M 0 to 
/ 3 ~ —1.3 up to ^ = 2.5, with no evidence for a change 
in the slope of the relation over this mass range (New- 
man et al. 2012). The steepness of the relation combined 
with the non-negligible scatter accommodates observa- 
tions that early-type galaxies display a large range in 
size at 2 > 1 (e.g.. Mancini et al. 2010; Saracco et al. 
2011). 

While the observational results have largely converged, 
the interpretation is still debated. Some authors have 
considered the average increase in size over time due to 
the addition of new, larger early-type galaxies, and while 
some argue that this cannot reproduce the observations 
(van der Wei et al. 2009a), others argue that a population 
of compact early-type galaxies (with sizes i? e ff 1$ 2kpc) 
exists within present-day clusters, with a number density 
comparable to that of higher-redshift early-type galaxies 
(Valentinuzzi et al. 2010; Poggianti et al. 2013); tension 
with the absence of such galaxies in the Sloan Digital 
Sky Survey (SDSS) remains (Trujillo et al. 2009; Taylor 
et al. 2010). The crucial observational test is to trace 
the evolution of the number density of early-type galax- 
ies as a function of their size. Cassata et al. (2011), 
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Szomoru et al. (2012), and Newman et al. (2012) show 
strong evolution in the number density of small galaxies 
at 0 < z < 2.5, while Carollo et al. (2013) claim no evo- 
lution at 0 < 2 < 1. Our use of 5 fields address the issue 
of field-to-field variations that may affect the aforemen- 
tioned studies based on smaller samples, and extend the 
dynamic redshift range of the Carollo et al. sample. 

The leading explanation for the size growth of individ- 
ual galaxies is accretion and tidal disruption of satellite 
galaxies that gradually build up the outer parts. For this 
process, the change in size is large compared to the in- 
crease in mass: A R c g oc AM* (e.g., Bezanson et al. 2009; 
Hopkins et al. 2009). This analytical prediction based on 
conservation of binding energy has been tested through 
numerical simulations (e.g., Naab et al. 2009; Oser et 
al. 2012; Bedorf & Portegies Zwart 2013). The analyti- 
cally predicted and simulated evolution in the increased 
surface mass density at large radii is, in fact, observed 
(van Dokkum et al. 2010); in addition, the central stel- 
lar density shows little evolution (Bezanson et al. 2009; 
Hopkins et al. 2009; van Dokkum et al. 2010), also con- 
sistent with a minor-merger scenario. In other words, the 
observations show that there is no need and little room 
to physically expand a galaxy by displacing large num- 
bers of stars to large radii through rapid changes in the 
central potential, as suggested by Fan et al. (2010). A 
possible challenge to the minor merger scenario is posed 
by the lack of strong evolution in the slope of the mass 
density profile seen in lensing galaxies (Sonnenfeld et al. 
2013). 

Until recently, the size evolution of late- and early-type 
galaxies was usually discussed separately and treated as 
different topics. However, in order to understand the 
joint evolution of these classes one has to take into ac- 
count the continuous transition of late- type to early-type 
galaxies seen in particular in the stellar mass range of 
10 10 M 0 to 10 11 Mq (e.g., Bell et al. 2004; Faber et al. 
2007; Brown et al. 2007; Ilbert et al. 2010; Brammer et al. 
2011; Buitrago et al. 2013; Muzzin et al. 2013). The star- 
forming progenitors of the small early-type galaxies are 
now looked for and plausibly identified (e.g., Whitaker 
et al. 2011; Barro et al. 2013,?; Toft et al. 2014), but the 
evolutionary path of the transitioning galaxies has not 
been fully mapped out. 

In this paper, regarding the evolution of early-type 
galaxies, we will 

• present the distribution of sizes as a function of 
stellar mass and redshift, jointly with those of late- 
type galaxies; 

• show the evolution of the number density of early- 
type galaxies as a function of size. 

After describing the data and sample selection in §2 
we present and analyze size distributions as a function of 
redshift and galaxy mass in §3. We compare our results 
with previous studies in §4 and then discuss the implica- 
tions of our findings in §5. We assume the cosmological 
parameters (Dm, Da, h) = (0.27, 0.73, 0.71). Finally, 
we use AB magnitudes and the Chabrier (2003) stellar 
initial mass function. 

2. DATA 


The procedures for source detection, multi- wavelength 
photometry, redshift determinations, rest-frame color 
and stellar mass estimates are described elsewhere. Here, 
we briefly summarize these steps. 

2.1. Source Detection 

207,967 sources in all five CANDELS/3D-HST fields 
(Koekemoer et al. 2011; Brammer et al. 2012a) are de- 
tected in and extracted from images that combine the 
available HST/WFC3 IR channel data; that is, stacked 
mosaics consisting of F125W, F140W and F160W imag- 
ing are constructed for this purpose. We refer to the 
photometry data release paper from the 3D-HST collab- 
oration by Skelton et al. (2014) for details. 

2.2. Photometric and Spectroscopic Redshift 
Determinations 

Multi-wavelength photometry from HST/WFC3 and 
HST/ACS imaging is produced by creating PSF-matched 
images, using custom-made kernels and performing sim- 
ple aperture photometry. Multi-wavelength photometry 
from ground-based optical-NIR and Spitzer/IRAC imag- 
ing is produced using the approach outlined by Labbe 
et al. (2006) and further developed by Gonzalez et al. 
(2010), which addresses blending by nearby sources and 
takes the large differences in PSF into account through 
the use of custom-made convolution kernels. Like alter- 
native methods such as TFIT (Laidler et al. 2007), our 
approach uses high-resolution images as priors to model 
sources in lower-resolution images. 

Photometric redshifts are determined based on the 
multi-wavelength photometry using the EAZY package 
(Brammer et al. 2008). (Skelton et al. 2014) describe 
the procedure in full, but it essentially follows Whitaker 
et al. (2011): briefly, linear combinations of a set of tem- 
plates that span the range of observed galaxy properties 
are used to fit the photometry, producing a marginal- 
ized, posterior probability distribution for the redshift, 
with Zphot as its peak. 

The photometric redshifts provide a baseline for 3D- 
HST WFC3 G141 grism spectroscopy to provide more 
precise redshift information. Brammer et al. (2012a) de- 
scribe the extraction of spectra and redshift determina- 
tion in detail. The method has been updated to use in- 
terlaced rather than drizzled HST /WFC3 mosaics, which 
are used to construct the photometric catalogs. For all 
sources brighter than FT60Wab = 23, the F140W im- 
age is traced along the dispersed WFC3/G141 grism im- 
age, such that a spectrum is extracted that accounts for 
the convolved spectral and spatial information of the low 
wavelength resolution ( R ~ 130) grism data. For each 
extracted object the spectroscopic information is com- 
bined with the photometric redshift probability distribu- 
tion, producing a new, best-fitting redshift (Momcheva 
et al. in prep.). Finally, spectroscopic redshifts from the 
literature are used when available. 

The grism data significantly improve the redshift pre- 
cision for thousands of galaxies, and provide indispens- 
able evidence for the good accuracy of the purely pho- 
tometric redshift estimates. The current version of the 
3D-HST redshift catalog contains grism redshift informa- 
tion for all objects brighter than H-pieaw = 23 and for 
which such data is available (~75% of the CANDELS 
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Figure 1. Rest-frame U — V vs. V — J color distribution for six redshift bins (each 0.5 wide). The two distinct classes of quiescent and 
star-forming galaxies are separated by the indicated selection criteria to define our early- and late-type galaxy samples. 


area). For our mass-limited sample defined below in 
Section 2.4 - this amounts to ^10,000 galaxies. This 
is ~30% of the total sample and 50% for the sample of 
massive (M* > 10 10 Mq) galaxies in the crucial red- 
shift range 1 < z < 3. For these galaxies the Quadri 
& Williams (2010) pair test demonstrates a precision of 
Az/(1 + z) = 0.003, or 0.3%. For purely photometric 
redshifts this is 1 — 2.5%, depending on the varying pho- 
tometric dataset available for each of the five fields, sug- 
gesting a factor 3—10 improvement in redshift precision 
from the grism data. There is no systematic offset be- 
tween the two sets of redshifts. 

2.3. Rest-Frame Colors and Stellar Mass Estimates 

EAZY is used to compute the rest-frame U—V and V — J 
colors and the package FAST (Kriek et al. 2009a) is used 
to estimate stellar masses. A large number of Bruzual & 
Chariot (2003) templates with solar metallicity, a wide 
range in age (4 x 10 7 yr to 12.5 x 10 9 yr, but always 
younger than the universe), exponentially declining star- 
formation histories (with time scales r = 10 7 — 10 10 yr), 
and dust extinction (Ay = 0 — 4) are used and matched 
to the photometry. The final stellar mass is corrected 
for the difference between the total F160W flux from the 
photometric catalog and the total F160W as measured 
with GALFIT (see §2.5). F125W is used in case F160W is 
not available. This correction ensures that our size and 
mass estimates are mutually consistent in that both are 
based on the same model for the light distribution. 

The uncertainties in the stellar mass estimates can be 
large and are to some extent still unknown. However, 


the possible error in mass-to-light ratio for low-mass blue 
galaxies with precisely known redshifts due to uncertain- 
ties in the SFH and metallicity is not larger than a fac- 
tor of a few. Moreover, for high-mass galaxies the rela- 
tion between dynamical and stellar mass estimates is the 
same to within a factor ~ 2 over the whole redshift range 
0 < z < 2, indicating that stellar masses are estimated 
reasonably consistently at different redshifts (e.g., van 
der Wei et al. 2006; van de Sande et al. 2013). The di- 
rect mass measurement of a z ~ 1.5 strong gravitational 
lens is also in agreement with the photometrically esti- 
mated stellar mass (van der Wei et al. 2013). For these 
reasons we are confident that the analysis presented in 
this paper, which looks at galaxies that span several or- 
ders of magnitude in stellar mass, is not fundamentally 
dependent on the current uncertainties in the mass esti- 
mates. 

2.4. Sample Selection 

Following Wuyts et al. (2007) and Williams et al. 
(2009), we utilize rest- frame colors to distinguish between 
the two basic classes of galaxies: star-forming and qui- 
escent galaxies. In this paper we refer to the former as 
late-type galaxies, and to the latter as early-type galax- 
ies. In Figure 1 we show the rest-frame U — V and V — J 
color distribution of galaxies with stellar mass in excess 
of 10 10 Mq for a range of redshifts. Even beyond z = 2 
the early- and late-type galaxies are separated in this 
space, which allows us to effectively assign each object 
to one of the two classes. 

An indication of the high fidelity of this selection 
method is that less than 20% of the thus classified early- 
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Figure 2. Observed #F 160 W magnitude versus stellar mass in six redshift bins. The color-coding represents the rest- frame U — V colors, 
ranging from U — V = 0 (blue) to U — V = 2 (red). The horizontal dashed lines indicate the limit (#F 160 W — 24.5) down to which we 
can determine sizes with good fidelity. The vertical lines illustrate the resulting mass completeness limits for blue (U — V = 0.5) and red 
{U — V = 2.0) galaxies, respectively. See §2.4 for further details. 


type galaxies with matches in the FIREWORKS catalog 
from Wuyts et al. (2007) are detected at 24/zm, while 
more than ~ 80% of the late-type galaxies are detected 

- these numbers are for the mass range 3 x 10 10 Mq < 
Mq < 10 11 M 0 and the redshift range 1 < z < 2. We 
note that a simpler selection by just U — V color would 
compromise the separation into types, as the sub-sample 
of galaxies that are red in U — V consist of dusty and 
quiescent objects. In addition, 80% of the color-selected 
early-type (late-type) galaxies have Sersic indices large 
(smaller) than n = 2.5. 

We adopt redshift- and color-dependent mass limits 
(Figure 2) that are set by the F160W magnitude limit 
down to which galaxy sizes here described in §2.5 

- can be determined with high fidelity, van der Wei 
et al. (2012) showed that galaxy sizes can be precisely 
and accurately determined down to a magnitude limit of 
-^Fi 60 W = 24.5; at fainter magnitudes the random and 
systematic errors can exceed 20% for large galaxies with 
large Sersic indices. Since most z > 1 galaxies in our 
sample have small sizes (0.3” in the median) and low 
Sersic indices (1.4 in the median) this magnitude limit is 
conservative. 

Out to z = 3 our magnitude-limited sample is com- 
plete in stellar mass down to ~ 10 10 Mq (see Figure 2). 
This limit is 1.9 magnitudes brighter than the 5er detec- 
tion limit, and simulations of artificial objects inserted in 
the images show that > 95% of all objects are detected 
(Skelton et al. 2014). Therefore, incompleteness will not 
be an issue for our sample, and biases against large (or 


small) galaxies will not play a significant role. 

Our mass-selected sample, with a redshift-dependent 
mass limit as described above (and a minimum of M* = 
10 9 Mq and Hfi 60 W = 25.5 at all redshifts), con- 
tains 32,722 galaxies. 43 of these are excluded be- 
cause of catastrophically failed surface brightness pro- 
file fits. We then manually verified the SEDs, size mea- 
surements, mass and redshift estimates of all 7,065 ob- 
jects with flagged GALFIT results (/ = 2 from van der 
Wei et al. 2012), small or large sizes (R e g < 0.6 kpc; 
R e g > 10 kpc), large stellar masses (M*/M 0 > 10 10 - 8 ), 
or large differences between photometric and grism red- 
shifts (A z > 0.15), as well as all early- type galaxies at 
redshifts 2 < z < 3. 1,721 (5.3%) were removed af- 
ter this verification due to errors in the size, redshift, or 
mass measurements. Hence, we have a final sample of 
30,958 galaxies; 21,828 (5,189) are at z > 1 (z > 2). 

In Figure 3 we show the rest-frame (U — V)-M* dis- 
tribution - a clear bi-modality is seen, equivalent to the 
bi-modality seen in Figure 1. As is well-known, more 
massive galaxies are redder and more likely to be quies- 
cent, at least up to z ~ 2.5. The most massive galaxies 
> 10 11 Mq are essentially all red. They are a mix of qui- 
escent and star-forming galaxies; the quiescent galaxies 
dominate in number at 2 < 1; the star- forming galaxies 
at z > 2. 

2.5. Size Determinations 

Galaxy sizes are measured as described by van der Wei 
et al. (2012) from mosaics created by the 3D-HST col- 


6 


Size-Mass Relation from CANDELS/3D-HST 



10 '° 10 11 10 10 10 " 10 10 10 " 


Stellar Mass (M 0 ) 


Figure 3. Rest-frame U — V color vs. stellar mass in six redshift bins. Early-type galaxies, defined as illustrated in Figure 1, are shown in 
red; late types in blue. A clearly defined red sequence is seen up to z = 3, with an increased prevalence of dusty late-type galaxies toward 
higher redshifts. 


laboration from public CANDELS F125W and F160W 
and 3D-HST F140W raw imaging data. With GALFIT we 
fit single-component Sersic profiles to the 2-dimensional 
light profiles of all detected objects, making use of 
custom-made PSF models; the package GALAPAGOS allows 
for simultaneous fitting of as many neighboring objects 
as needed, in order to avoid confusion. As the effec- 
tive radius we use the semi-major axis of the ellipse that 
contains half of the total flux of the best-fitting Sersic 
model. For a full description of the procedure we refer 
to van der Wei et al. (2012); the size measurements pub- 
lished in their catalog and the size measurements here 18 
are fully consistent. 

Color gradients and their evolution affect the size mea- 
surements to a degree that greatly exceeds the statistical 
uncertainties in our sample averages and the size distri- 
butions as a function of galaxy mass and redshift. The 
dynamic range in wavelength for our sample is rather 
small (from 1.25pm to 1.6pm) which does not allow to 
systematically probe the effect of color gradients over the 
large range in redshift. To extend the dynamic range we 
analyze ACS/F814W imaging in the COSMOS field from 
CANDELS parallel observations. GALAPAGOS is used to 
obtain galaxy sizes in precisely the same manner for the 
F814W data as for the WFC3 data. 

Using this extension in wavelength we show in Fig- 


18 The results from our GALFIT profile fits used here, with IDs 
matched to the Skelton et al. (2014) catalogs, are publicly available 
for all 5 fields: 

http://www.mpia-hd.mpg.de/homes/vdwel/3dhstcandels.html. 


ure 4 the wavelength dependence of galaxy size for a 
sample of 777 late-type galaxies. The figure shows 
A log R e g / A log A as a function of galaxy mass and red- 
shift. For galaxies at z < 1, the difference between 
log R e g from the F814W and F125W size measurements 
was taken as Alogi? e fj; for z > 1 galaxies the difference 
between logi? e g from the F125W and F160W measure- 
ments were taken. The pivot wavelengths of the respec- 
tive filters are used to compute A log A. 

The generally negative values of the quantity 
A log R e g / A log A imply that late- type galaxies are typ- 
ically smaller at longer wavelengths and thus have neg- 
ative color gradients (redder centers). The color gra- 
dients of z > 1 galaxies have been extensively studied 
before (e.g., Szomoru et al. 2011; Wuyts et al. 2012), 
and here we merely mention the trends that are rele- 
vant for our conversion of the measured R e g to the R e g 
at a rest-frame wavelength of 5000A. Color gradients 
are strongest for the most massive galaxies at all red- 
shifts, and stronger at later cosmic times for galaxies of 
all masses. The evolution is remarkably smooth, which 
allows us to parametrize the wavelength-dependence of 
R e g as a simple function of redshift and galaxy stellar 
mass: 


A log R eS 
A log A 


-0.35 + 0.12,-0.251og( I5 ^ J -). (1) 


Kelvin et al. (2012) derive a very similar wavelength 
dependence of R e g, but a direct comparison cannot be 
made as those authors do not distinguish between galax- 
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Figure 4. Wavelength dependence of R e ff in bins of stellar mass 
and redshift; the latter is indicated by the color coding. Late- type 
galaxies, as defined in Figure 1, with robust size measurements 
from ACS/F814W, WFC3/F125W and WFC3/F160W imaging are 
included (see text for details). Generally, sizes are smaller at longer 
wavelengths, that is, late- type galaxies are bluer in the outer parts. 
Moreover, this gradient is stronger for more massive galaxies at all 
redshifts, and the gradient decreases with redshift, at the same 
rate for all masses. The dotted lines represent the parametrization 
given in Eq. 1 that we use to correct our size measurements of 
late- type galaxies. 

ies with different masses. Then, R e ff at a rest-frame 
wavelength of 5000A is estimated as 

, A lQ g R »ff 

R,« = • ( 2 ) 

\ i + z p / 

where F denotes either F125W (for galaxies at z < 1.5) 
or F160W (for galaxies at z > 1.5), and z p is the ‘pivot 
redshift’ for these respective filters (1.5 for F125W and 
2.2 for F160W). Positive color gradients as computed 
with Equation 1 are set at zero. 

A similarly detailed correction for early-type galaxies is 
currently not feasible as that sub-sample is much smaller 
and has much redder colors. For the 122 early-type 
galaxies in the COSMOS field at redshifts 0 < z < 2 and 
with robust size measurements in all three filters, we find 
an average size gradient of A log R e g / A log A = —0.25, 
with no discernible trends with mass and redshift, in 
reasonable agreement with Kelvin et al. (2012) for low- 
redshift and Guo et al. (2011) for high-redshift early-type 
galaxies. Hence, we adopt this value for all early-type 
galaxies when computing their rest-frame 5000A sizes 
with Equation 2. 

This rather convoluted procedure has a small but sig- 
nificant effect on the size measurements, with implica- 
tions for the rate of size evolution. In the redshift range 
1 < z < 2 the size corrections are always smaller than 
0.05 dex, because the observed wavelength is always close 
to the desired rest-frame wavelength; at z < 1 the cor- 
rection can increase up to 0.15 dex, but there the color 
gradient is well-constrained (see Figure 4) for all galaxy 


masses. 

3. EVOLUTION OF THE SIZE-MASS DISTRIBUTION AT 
0 < Z <3 

The size-mass distributions for early- and late-type 
galaxies as a function of redshift are shown in Figure 
5. The first basic observation is that a well-defined size- 
mass relation exists at all redshifts 0 < z < 3 for both 
populations. With increasing redshift, the two classes 
have increasingly different size-mass relations, mostly as 
a result of strongly decreasing R e g for early types. In ad- 
dition, at all redshifts the slope of the relation is steeper 
for early types than for late types. At all stellar masses 
< 10 11 Mq early-type galaxies are, on average, smaller 
than late-type galaxies, but at very high stellar masses 
(> 2 x 10 11 Mq) the two populations have similar sizes 
at all redshifts, as far as the small samples allow for such 
a comparison. 

However, the relation for early-type galaxies flattens 
out below M* = 2 x 10 10 M 0 , at least up to z = 1.5 be- 
yond which our sample is incomplete at these low masses. 
This implies that the peak in surface mass density at 
~ 4 x 10 10 Mq seen in the present-day universe also ex- 
isted at larger look-back times, at least up to z ~ 2. 

In §3.1 we will first provide an analytical description 
of the size-mass relation, which allows us to take cross- 
contamination between the two types and outliers into 
account. In the remainder of this section we will provide 
various direct measurements such as median sizes and 
percentile distributions as a function of mass and red- 
shift, and describe trends that are not captured by our 
analytical description, such as deviations from a single 
power law and skewness of the size distribution. 

3.1. Analytical Description 

The basic characteristics of the galaxy size distribution 
are given by the slope, intercept and (intrinsic) scatter of 
size as a function of mass. We parametrize this following 
Shen et al. (2003) and assume a log-normal distribution 
A(logr, criogr), where logr is the mean and <Ti ogr is the 
dispersion. Furthermore, r is taken to be a function of 
galaxy mass: 

r(ro*)/kpc = A-m“, (3) 

where m* = M*/ 7 x 10 10 Mq. As we will describe in 
§3.3 it is reasonable to assume that <Ti ogr is independent 
of mass. 

The model distribution V(logr(m*), ai ogr ) prescribes 
the probability distribution for observing i? e ff for a 
galaxy with mass m*. If the measured i? e ff has a Gaus- 
sian, l-er uncertainty of <5 log R e s, then the probability for 
this observation is the inner product of two Gaussians: 

p = {N (log R eS , 8 log R eS ), A(logr(TO*),crio g r)}- (4) 

Thus we compute for each galaxy the probabilities Pet 
and Plt for the respective size-mass distribution models 
for the early-type and late-type populations. Incomplete- 
ness terms should formally be included in these proba- 
bilities (as described by, e.g., Huang et al. 2013), but due 
to our conservative sample selection (see §2.4) we are not 
biased against faint, large objects. 

The uncertainty in size, 51ogP e ff, is computed as out- 
lined by (van der Wei et al. 2012). A random uncertainty 
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Figure 5. Size-stellar mass distribution of late- and early-type galaxies (same symbols as in Figure 2). A typical 1 a error bar for individual 
objects in the higher-redshift bins is shown in the bottom-right panel. The lines indicate model fits to the early- and late-type galaxies as 
described in §3.1. The dashed lines, which are identical in each panel, represent the model fits to the galaxies at redshifts 0 < z < 0.5. 
The solid lines represent fits to the higher-redshift samples. The mass ranges used in the fits are indicated by the extent of the lines in the 
horizontal direction. Strong evolution in the intercept of the size-mass relation is seen for early-type galaxies; moderate evolution for the 
late-type galaxies (also see Figure 6. There is no significant evidence for evolution in the slope (also see Figure 6). The parameters of the 
fits shown here are given in Table 1. 


of 0.15 dex in m» is included in our analysis by treating 
it as an additional source of uncertainty in R e g' for a 
size-mass relation with a given slope, an offset in m* 
translates into an offset in R e g. Hence, the calculation 
of P stays 1-dimensional. The fiducial slopes we use to 
convert <nogi? e ff into <5m* are a = 0.7 for early-type 
galaxies and a = 0.2 for late- type galaxies. 

We also take into account the mis-classification of 
early- and late-type galaxies. Despite the bimodal distri- 
bution in the color-color diagram (§2.4; Figure 1) there 
are galaxies in the region between the star-forming and 
quiescent sequences, making their classification rather ar- 
bitrary, causing cross-contamination of the two classes 
(also see Holden et al. 2012). Motivated by this work, 
we take this mis-classification probability to be 10%. We 
will comment on the effects of varying this parameter 
below, when we describe the fitting results. 

The mis-classification probability precisely corre- 
sponds to the early- and late-type contamination frac- 
tions in a sample in case the two sub-samples have equal 
numbers of galaxies. The actual contamination fraction 
scales with the ear ly-/late- type fraction, which depends 
on galaxy mass and redshift. The evolution of the stellar 
mass function for the two types is described by Muzzin 
et al. (2013), which we use here to compute this ratio. 
We also allow for 1% of outliers: these are objects that 
are not part of the galaxy population, for example, catas- 
trophic redshift estimates or misclassified stars. Finally, 


in order to avoid being dominated by the large number of 
low-mass galaxies, we also assign a weight to each galaxy 
which is inversely proportional to the number density. 
This ensures that each mass range carries equal weight 
in the fit. The number density is taken from the Muzzin 
et al. (2013) mass functions. 

Then, we compute the total likelihood for a set of six 
model parameters (intercept A, slope a, and intrinsic 
scatter a\ og # eff , each for both types of galaxies) : 


-ET 


= ^ln \w- ((1 — C) ■ Pet + C ■ Pur + O.Ol) 1 (5) 


for early-type galaxies, and 

Cut = ln [ w ' ((! - C) ■ P LT + C- P E t + 0.0l)] (6) 


for late-type galaxies, where W is the weight and C is 
the contamination fraction, both of which are a function 
of redshift and mass. The best-fitting parameters are 
identified by finding the model with the maximum total 
likelihood, C = £et + £lt- 
For the late types we fit all galaxies with M* > 
3 x 10 9 M 0 ; this limit provides a good dynamic range of 
two orders of magnitude in mass, and exceeds the mass 
limit of our sample up to z = 2.5 (Figure 2). For the 
early types we fit all galaxies with M» > 2 x 10 10 M 0 , so 
that we avoid the clearly flatter part of the size-mass dis- 
tribution at lower masses (see §3.2). This cut-off exceeds 
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Figure 6. Parametrized redshift evolution of the size-mass relation, from the power-law model fits shown in Figure 5. The left-hand panel 
shows the evolution of the intercept, or the size evolution at fixed stellar mass of 5 X 10 10 Mq. Strong evolution is seen for high-mass 
early-type galaxies; moderate evolution for low-mass early types and for late-type galaxies. The middle and right-hand panels show the 
evolution of the slope and intrinsic (model) scatter of the size-mass relation, either with little or no evidence for changes with redshift. The 
open symbols represent the observed scatter: these measurements do not take measurement uncertainties and contamination into account. 
The fitting parameters shown in this figure are given in Table 1. 



Figure 7. Evolution-corrected average sizes at M* = 5 X I A 1 11 Mq 
for late-type galaxies (top panel, in blue) and early-type galaxies 
(bottom panel, in red) . The values shown here are the values shown 
in the left-hand panel of Figure 6, divided by (l + z)^ 2 as indicated 
on the y-axis. The residuals from the best-fitting (1 +z)@ z law indi- 
cate that parameterizing the evolution as a function of the Hubble 
parameter (7J e fj oc h(z )^ n ) may provides a more accurate descrip- 
tion for the late-type galaxies. See §3.2 for further discussion. 

the mass limit of our sample up to 2 = 3. 

The black lines in Figure 5 indicate the fitting results, 
and the evolution of the individual model parameters 
(intercept, slope and scatter) are shown in Figure 6. The 
fitting results are also given in Table 1. The intercept 
of the best-fitting size mass model distributions evolves 
significantly with redshift, and particularly rapidly for 
the early types. 

Usually, the evolution of the intercept is parametrized 
as a function of (1 + z). While this is intuitively ap- 
pealing because of our familiarity with the cosmologi- 


cal scale factor, this is perhaps not the physically most 
meaningful approach. Galaxy sizes, in particular disk 
scale lengths, are more directly related to the properties 
of their dark matter halos than to the cosmological scale 
factor. Halo properties such as virial mass and radius 
follow the evolving expansion rate - the Hubble param- 
eter H(z ) - instead of the cosmological scale factor. For 
a matter-dominated universe, H(z) and (1 + z) evolve at 
a similar pace, but as a result of the increased impor- 
tance at late times of A for the dynamical evolution of 
the universe, H(z) evolves much slower in proportion to 
(1 + z) at late times than at early times. For example, 
at z ~ 0 we have H(z) oc (1 + z) 0A , while at at 2 ~ 2 
this is H(z) oc (1 + z) 1A . 

For this reason it is reasonable to parametrize size evo- 
lution as a function of H(z ) in addition to (1 + z). The 
solid lines in the left-hand panel of Figure 6 represent the 
evolution as a function of ff(z), while the dashed lines 
represent the evolution as a function of (1 + z). These re- 
sults are also given in Table 1. The H(z)^ H parametriza- 
tion is marginally preferred by the data over the (l + z)P* 
parametrization, as is more clearly illustrated in Figure 
7, where we show the residuals. In addition to the statis- 
tical limitations, we note that these residuals are of the 
same magnitude as the systematic uncertainties in the 
size measurements and color gradient corrections (§2.5). 
A more thorough comparison with size evolution of larger 
samples at z < 1 with size measurements at visual wave- 
lengths would improve these constraints. 

Newman et al. (2012) first demonstrated the lack of 
strong evolution in the slope of the size-mass relation for 
massive (> 2 x 10 10 Mq) early-type galaxies. Here, we 
confirm that result (middle panel, Figure 6), and find 
a slope of R e ff oc M °- 75 at all redshifts. This slope is 
somewhat steeper than measured by Shen et al. (2003) 
for present-day early-type galaxies. Differences in sam- 
ple selection (star-formation activity vs. concentration) 
and methods ( R e g from Sersic profile fits vs. Petrosian 
half-light radii) may explain this difference. For the 
first time we extend the analysis to late-type galaxies: 
the slope is much flatter than the slope for early types 
(i? e ff oc M 0 - 22 ), with little or no change with redshift. 
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Figure 8. Median (points) and 16th and 84th percentiles (lines) of the size-stellar mass distributions shown in Figure 5. The scatter in 
R e g does not strongly depend on galaxy mass. Deviations from the power-law form of the size-mass relation are clear for massive late-type 
galaxies and for low-mass early-type galaxies. Note that here we do not account for contamination (misclassified early- and late-type 
galaxies) . 


This slope is intermediate to the slope found by Shen 
et al. (2003) for low- and high-mass galaxies. Our sam- 
ple contains too few high-mass late-type galaxies to per- 
form a robust double-component power-law fit, as done 
by Shen et al. (2003), but in §3.2 we will show evidence 
for the steepening of the relation for massive late-type 
galaxies out to z = 1. 

Finally, we present the first measurement of the intrin- 
sic scatter in size beyond the local universe (right-hand 
panel of Figure 6). We find no strong evolution for ei- 
ther late types or early types, and that the scatter for the 
early- type population is always somewhat smaller (0.1 to 
0.15 dex) than for the late-type population (0.16 - 0.19 
dex). These numbers agree well with the intrinsic scatter 
measured by Shen et al. (2003) for present-day galaxies: 
0.13 dex for early-type galaxies, and 0.20 dex for late- 
type galaxies. We note that the effect of measurement 
uncertainties were not included by Shen et al. (2003). 

For comparison we show the observed scatter at each 
redshift, calculated as the standard deviation in R e g after 
subtracting the best-fitting size-mass relation. The val- 
ues for early-type galaxies are in the range 0.2-0. 3 dex, 
in good agreement with the values found by (Newman 
et al. 2012) over the same redshift range. In particular, 


the strongly increased observed scatter in size for the 
early-type galaxies at £ > 2 is largely attributed to sig- 
nificant contamination by mis-classified late-type galax- 
ies. We have assumed a mis-classification probability of 
10% (resulting in an assumed mis-classified fraction of 
C = 0.10 in the case of equal numbers of early- and late- 
type galaxies - see above) , but although this value is em- 
pirically motivated it is not known with great precision. 
If we decrease (increase) the mis-classification probabil- 
ity to 5% (20%) then the recovered intrinsic scatter for 
the z = 2.0 — 2.5 early-type galaxy sample, for example, 
increases (decreases) to 0.18 (0.11). 

At this point we should also comment on the effect of 
changing the value for the assumed random uncertainty 
in stellar mass (here, 0.15 dex). Decreasing its value 
has no measurable effect, while increasing it to 0.30 dex 
decreases the recovered value for the intrinsic scatter fur- 
ther, to 0.05 dex for the 2 = 2.0 — 2.5 early-type galaxy 
sample. In this sense, the derived values for the intrinsic 
scatter are upper limits. 

While our particular choices in modeling the uncertain- 
ties affect the results with (marginal) significance, they 
do not affect our general conclusions that the intrinsic 
size scatter 1) is <0.20 dex for both types of galaxies, 
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Figure 9. Median size as a function of stellar mass and redshift for early-type galaxies (left) and late-type galaxies (right). SDSS 
data points based on (Guo et al. 2009) are shown as open points. Fits to the median sizes of the form -R e ff/kpc = B z (l + z)^ z and 
B H (H(z)/ H o)Pn are shown by dotted and solid lines, respectively. The evolution of the early-type galaxies is independent of mass at 
M* > 2 x 10 10 Mq: massive galaxies evolve fast and have a steep size- mass relation at all redshifts, while the relation flattens out at lower 
masses (< 10 10 Mq and evolves less rapidly. The evolution of the late-type galaxies is overall slower, and does not depend strongly on 
mass. The low-mass early-type galaxies evolve at the roughly the same pace as the late-type galaxies. The median sizes and fitting results 
are given in Table 2. 


and 2) does not strongly evolve with redshift. However, 
the conclusion that the scatter for early-type galaxies is 
smaller than for late-type galaxies at all redshifts - as 
is seen for present-day galaxies - should at this stage be 
regarded as tentative. 

Finally, we note that changes in the mis-classification 
probability or quncertainty in stellar mass do not sig- 
nificantly affect the recovered values of the other model 
parameters (zero point and slope). 

3.2. Evolution of median sizes 

In this section we offer a complementary description 
of the evolution of the size-mass relation. In Figure 8 
we show the median sizes as a function of mass and red- 
shift, along with the 68th percentile width of the size 
distribution. The values are listed in Table 2. Up to 
z ~ 1.5 the relation for late types steepens and tight- 
ens at the high-mass end. Shen et al. (2003) modeled 
the steepening by assuming a two-component power law, 
but we sample an insufficiently large volume and sample 
size at > 10 11 Mq to include this in our analytical de- 
scription presented above (§3.1). The flattening of the 
size-mass relation for low-mass early types is also clearly 
seen. Inspection of the SEDs of individual galaxies con- 
firms that these are truly quiescent galaxies, with strong 
4000 A breaks. As we showed in §3.1, the large apparent 
increase in the scatter for high-redshift early types can 
be partially attributed to contaminants and outliers. 

We provide complementary sets of median size and 
scatter measurements in the Appendix. These in- 
clude the commonly used circularized radii: I? e ff,circ = 
R e s^/bja, where b/a is the projected axis ratio. In ad- 
dition, we provide the measurements for the combined 
late+early-type galaxy sample, and the measurements 


in bins of rest-frame U-band luminosity. 

Figure 9 shows the median size evolution for galaxies 
in different mass bins. We parametrize this evolution 
both as a function of H(z) and of (1 + z) - see §3.1. The 
results are shown as solid and dotted lines, respectively, 
in Figure 9, and are also given in Table 2. 

Ideally, an immediate comparison with the size-mass 
distribution of nearby galaxies provides a strong con- 
straint on the evolution. However, such comparisons are 
fraught with systematic uncertainties. The aim here is 
merely to show that our observations from CANDELS 
and 3D-HST are consistent with the size-mass relation 
for nearby galaxies as measured from the SDSS (Shen et 
al. 2003), who provided the standard reference for this 
purpose. 

In order to account for possible systematic differences 
we compare the size measurements from Shen et al. 
(2003) with those from Guo et al. (2009) on an object-by- 
object basis. The reason for using the Guo et al. (2009) 
measurements as a baseline is that they are based on the 
same technique GALAPAGOS from Barden et al. (2012) - 
as used in this paper. We shift the analytical descriptions 
for the size-mass relations from Shen et al. (2003) accord- 
ing to the measured, systematic offset between Shen et 
al. (2003) and Guo et al. (2009). This amounts to a 0.1 
dex shift to larger R e s compared to Shen et al. (2003) 19 . 
In order to be conservative we also adopt 0.1 dex as the 
systematic uncertainty. We show the inferred sizes for lo- 
cal galaxies in Figure 9. The median size evolution traced 
out by the 3D-HST / CANDELS data predicts z ~ 0 sizes 
that are consistent with our adjusted Shen et al. (2003) 
median sizes. 

19 Note that we use major axis R e g in this paper, as opposed to 
Shen et al. (2003), who use circularized radii 
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Figure 10. Size distribution histograms for early- and late-type galaxies as a function of stellar mass (as labeled on the right-hand side) 
and redshift (as labeled at the top). The number of galaxies is given in units of co-moving volume to illustrate the growth of the population 
over time. The early-type size distributions are fit with Gaussians with a fixed dispersion of 0.16 dex. The late-type size distributions are 
fit with skewed Gaussians with a fixed dispersions of 0.18 dex and skewness /13 = —0.15. The panels with thin lines show samples which 
are below our mass limit. 
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Figure 11. Cumulative size distributions of ~ L* early-type 
galaxies (top) and ~ L* late-type galaxies (bottom) as a function 
of redshift. While the number density of both early- and late- type 
galaxies increases over time, the number density of small galaxies 
declines, implying that the observed evolution in the mean size is 
not (solely) driven by the addition of larger galaxies. Individual 
galaxies must evolve in size. 



Figure 12. Size evolution of galaxies in a narrow (0.3 dex) mass 
bin around lO 10 Mq. The black points represent UV-bright galax- 
ies (with U — V < 1 in the rest-frame), selecting a sample akin 
to LBGs at high redshift. Their size evolution is fast, consistent 
with the size evolution of UV-selected samples up to z = 7 as re- 
cently determined by Mosleh et al. (2012) - also see Oesch et al. 
(2010). The blue points represent late-type galaxies as defined in 
this paper (see Figure 1), that is, all star-forming galaxies. The 
size evolution of those is slower at low redshift, consistent with 
previous measurements at z < 1 (here, Barden et al. 2005). 

The picture provided by median size distributions is 


Number density of compact galaxies 



Figure 13. Number density evolution of small, massive early- 
type galaxies. In each redshift bin (with equal co-moving volume) 
we include early-type galaxies with mass M* > 5 X 1 0 1 11 Mq and 
size l? e ff(kpc) < (M*/5 X 10 10 Mq) 0,7 . That is, the slope of the 
size-mass relation is taken into account: for M* = 5 X 10 10 Mq 
the size limit is lt e £j = 1 kpc; for M* = 10 11 Mq the size limit 
is R e fj = 1.6 kpc. The number density first increases with cosmic 
time, reaching a plateau at z ~ 1.5 — 2, after which it strongly 
decreases toward the present day. The immediate implication is 
that individual galaxies must grow in size significantly, most likely 
through merging. 

consistent with our analytical description (§3.1), with 
fast evolution for the (massive) early types and moder- 
ate evolution for the late types. In addition, we see that 
the flattening of the relation for low-mass early types 
coincides with slower evolution. Interestingly, low-mass 
early-type galaxies evolve at a similar same rate as late 
types of the same mass. For the late types we see a mild 
dependence on mass: the more massive late types evolve 
slightly faster than the less massive late types. For fur- 
ther discussion, see §5. 

3.3. Skewness in the R e g distribution of late-type 
galaxies 

The 16- and 84-percentile range for late- type galaxies is 
not precisely centered on the median size (Figure 8 and 
Table 2), implying a skewness in the size distribution. 
To examine this further we show size distribution his- 
tograms for a set of mass and redshift bins in Figure 10. 
The asymmetric size distribution for late-type galaxies is 
due to a tail of small galaxies. The small sizes are not 
due to point-like contributions from AGN: the 1 — 10/nn 
photometry of these objects does not show the power- 
law spectral energy distributions that are characteristic 
for unobscured AGN. 

As a result of this skewness there is substantial over- 
lap between the size distributions of early types and late 
types and no clear bi-modality; this, despite large dif- 
ferences in the average sizes. Figure 11 shows - more 
clearly than Figure 10 - that the size distributions of the 
two types overlap at all redshifts, up to z ~ 3. 
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Table 1 

Results from the parametrized fits to the size-mass distribution of the form R e ff/kp c = A(M*/5 • 10 10 Mg)“, as described in §3.1 and 
shown in Figures 5 and 6. cr(logi? e ff) is the scatter in R e ff in logarithmic units. 



Early- Type Galaxies 

Late- Type Galaxies 

z 

log(A) 

a. 

crlog(ii eff ) 

A 

a. 

CTlog(fl e ff) 

0.25 

0.60±0.02 

0.75±0.06 

0.10±0.02 

0.86±0.02 

0.25±0.02 

0.16±0.01 

0.75 

0.42±0.01 

0.71±0.03 

0.11=60.01 

0.78±0.01 

0.22±0.01 

0.16±0.01 

1.25 

0.22±0.01 

0.76±0.04 

0.12±0.01 

0.70=60.01 

0.22±0.01 

0.17±0.01 

1.75 

0.09±0.01 

0.76±0.04 

0.14±0.01 

0.65±0.01 

0.23±0.01 

0.18±0.01 

2.25 

-0.05±0.02 

0.76±0.04 

0.14±0.02 

0.55±0.01 

0.22±0.01 

0.19±0.01 

2.75 

-0.06±0.03 

0.79±0.07 

0.14±0.03 

0.51±0.01 

0.18±0.02 

0.19±0.01 


Table 2 

Logarithmic size distribution (16% - 84% range) as a function of galaxy mass and redshift. The masses in 
the first column are the centers of 0.5-wide bins. Redshift dependence in the form R e ff/kpc = 

iWkpc = B H (H(z)/H 0 )fe are also given. 


the header and the redshifts in 
= B z (l + z)^ z and 


Z 

Early-Type Galaxies 

Late- Type Galaxies 

M* = 10 9 - 25 

9.75 

10.25 

10.75 

11.25 

9.25 

9.75 

10.25 

10.75 

11.25 

16% 
0.25 50% 

84% 

0.03±0.06 

0.27=60.02 

0.46±0.03 

0.04±0.03 

0.28±0.02 

0.46±0.04 

0.13±0.03 

0.38±0.03 

0.58±0.04 

0.42±0.02 

0.67±0.02 

0.92±0.05 

0.65±0.05 

0.76±0.09 

1.08±0.08 

0.24±0.01 

0.49±0.01 

0.70±0.01 

0.36±0.02 

0.61±0.02 

0.80±0.01 

0.42±0.02 

0.66±0.03 

0.85±0.02 

0.61±0.03 

0.83±0.03 

1.01±0.04 


16% 
0.75 50% 

84% 

-0.02=60.03 

0.23±0.01 

0.43±0.02 

-0. 14±0.02 
0.21=60.02 
0.44±0.02 

0.02±0.02 

0.23±0.01 

0.42±0.02 

0.26±0.01 

0.45±0.01 

0.64=60.02 

0.62±0.02 

0.81=60.02 

0.97=60.02 

0.18±0.01 

0.43±0.01 

0.65±0.01 

0.32±0.01 

0.56±0.01 

0.76±0.01 

0.39±0.01 

0.64±0.01 

0.83±0.01 

0.51±0.02 

0.75±0.01 

0.90±0.01 

0.77±0.02 

0.90±0.04 

1.12±0.03 

16% 
1.25 50% 

84% 


-0.15=60.03 

0.18±0.03 

0.42±0.04 

-0.15±0.02 

0.09±0.02 

0.36±0.03 

0.07±0.01 

0.30=60.01 

0.54=60.03 

0.41±0.02 

0.58±0.03 

0.81=60.03 

0 . 11 ± 0.01 

0.37±0.01 

0.60±0.01 

0.23±0.01 

0.48±0.01 

0.69±0.00 

0.33±0.01 

0.57±0.01 

0.77±0.01 

0.47±0.01 

0.67±0.01 

0.83±0.01 

0.62±0.03 

0.82±0.03 

0.96±0.02 

16% 
1.75 50% 

84% 


-0.02±0.06 

0.22±0.03 

0.48±0.06 

-0.27±0.02 

0.02±0.03 

0.35±0.03 

-0.04±0.02 

0.19±0.02 

0.50=60.03 

0.28±0.02 

0.45±0.02 

0.74±0.04 

0.07±0.01 

0.33±0.01 

0.57±0.01 

0.16±0.01 

0.42±0.01 

0.65±0.01 

0.28±0.01 

0.52±0.01 

0.72±0.01 

0.35±0.02 

0.61±0.01 

0.80±0.01 

0.53±0.04 

0.70±0.02 

0.87±0.02 

16% 
2.25 50% 

84% 



-0.37±0.08 

-0.04=60.07 

0.36±0.03 

-0.20=60.02 

0.08±0.03 

0.54=60.04 

0.16±0.03 

0.36±0.05 

0.55±0.07 


0.10=60.01 

0.35±0.01 

0.57±0.01 

0.17±0.02 

0.44±0.01 

0.64±0.01 

0.26±0.03 

0.53±0.01 

0.70±0.01 

0.40±0.02 

0.64±0.02 

0.84±0.02 

16% 
2.75 50% 

84% 




-0.22±0.05 

0.10±0.03 

0.50=60.10 

0.07=60.07 

0.39±0.08 

0.68±0.10 



0.16±0.02 

0.43±0.01 

0.65±0.02 

0.19±0.06 

0.47±0.01 

0.71±0.02 

0.33±0.04 

0.55±0.02 

0.76±0.04 

B z 

Pz 


0.29±0.01 

-0.22=60.02 

0.47±0.02 

-1.01±0.06 

0.75±0.04 

-1.24±0.08 

1.05±0.09 

-1.32=60.21 

0.54±0.01 

-0.48±0.03 

0.69±0.01 

-0.63±0.02 

0.74±0.03 

-0.52±0.08 

0.90±0.05 

-0.72±0.09 

1.05=60.08 

-0.80±0.18 

Bh 

P H 


0.27±0.01 

-0.19=60.02 

0.42±0.02 

-0.97±0.05 

0.68±0.03 

-1.13±0.06 

0.97=60.06 

-1.29=60.16 

0.52±0.01 

-0.52±0.02 

0.65=60.01 

-0.58±0.02 

0.71±0.03 

-0.49±0.07 

0.86±0.04 

-0.65±0.09 

1.01±0.06 

-0.76±0.13 
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Figure 10 also shows tails of large, early-type galax- 
ies. However, this can likely be attributed to mis- 
classification: their number is always an order magnitude 
smaller than the number of late-type galaxies with sim- 
ilar sizes, consistent with the assumed mis-classification 
probability in our analysis in Section 3.1. We note that 
the tail of small, late-type galaxies is not consistent with 
the expected number of mis-classificatied objects: its 
prominence appears to be unrelated to the early-type 
population. 

As an illustration that the size distribution for both 
early- and late-type galaxies evolve smoothly and reg- 
ularly, we fit Hermite-Gaussian functions to the his- 
tograms shown in Figure 10. This provides a reasonable 
description for all redshift and mass bins. 

For a discussion of the implications of these results in 
the context of previous results we refer to §4.2 and §5.2. 

4. COMPARISON WITH PREVIOUS RESULTS 
4.1. Late-Type Galaxies 

The results for size evolution of late-type galaxies 
shown in Figures 6 and 9 are consistent with most other 
recent measurements that focus on the z = 1 — 2 red- 
shift range (e.g., Franx et al. 2008; Buitrago et al. 2008). 
Williams et al. (2010) found somewhat faster evolution 
for late types (and slower evolution for early types) , but a 
direct comparison with their Figure 4 reveals that their 
and our size measurements are in fact fully consistent 
over the redshift range 0.5 < z < 1.5. The difference 
in the quoted pace of evolution is likely the result of the 
increased dynamic range in redshift probed here, in addi- 
tion to the use of a low-redshift comparison sample from 
the SDSS (see §3.2). 

Thus, like previous studies focusing on the 2 = 1 — 2 
redshift regime, we find that the pace of evolution for 
late-type galaxies is intermediate to the slow evolution 
of disk galaxies at z < 1 (Lilly et al. 1998; Ravindranath 
et al. 2004; Barden et al. 2005) and the fast evolution 
of UV-selected galaxies at 2 > 2 (Giavalisco et al. 1996; 
Ferguson et al. 2004; Oesch et al. 2010; Mosleh et al. 
2012). Our data set allows us to bridge these regimes 
and probe the origin of this difference. 

In Figure 12 we show the size evolution of galaxies 
with stellar mass M* ~ 10 10 Mq from the present day 
up to z ~ 6. Here we have relaxed our magnitude limit 
to F/fi 60 W — 26, which is still within the completeness 
limit of the CANDELS imaging as can be seen in Figure 
2. Size measurements of individual galaxies are no longer 
reliable at Hfi 60 W = 26, but the sample average is still 
robust to within 15% (van der Wei et al. 2012). For 
galaxies bluer than U— V = 1, thus selecting a population 
akin to LBGs we probe the population out to z ~ 6. The 
median size evolves quickly with redshift, R e s oc (1 + 
z)- 1 ' 1 , consistent with recent measurements by Oesch et 
al. (2010) and Mosleh et al. (2012). 

Once we include all late-type galaxies, regardless of 
color, the evolution matches that of the U — V < 1 galax- 
ies at z > 2. This is simply because essentially all galax- 
ies are blue: the color-blind sample is not biased against 
galaxies with U — V > 1 up to z = 3 — 3.5. At lower red- 
shift red star-forming galaxies appear, which are smaller 
in size than their blue counterparts and slow down the 
average size evolution. At z < 1 UV-bright galaxies are 


rare and the evolution is dominated by redder galaxies, 
which evolve in size more slowly in agreement with the 
results from, for example, Barden et al. (2005). 

As we argued in §3.2 galaxy sizes may be better 
parametrized as a function of H(z) than as a function of 
(1 + 2 ). While the former naturally implies slower evolu- 
tion at late times than the latter (see the red, dotted line 
in Figure 12) not all trends are captured by using the 
H(z) parametrization: 1) the evolution of all late types 
slows down more rapidly than can be explained by the 
difference between the two parametrizations; 2) the UV- 
bright sample shows little evidence for slowed evolution 
at 2 < 1. 

We conclude that the diverging pace of evolution seen 
at 2 < 1 and z > 3 as reported in the literature is in 
part due to sample selection effects and part due to the 
different evolution of red and blue late-type galaxies. 

4.2. Early-Type Galaxies 

As we discussed in the Introduction there is broad 
agreement that the average sizes of ~ L* early-type 
galaxies, as measured at a fixed stellar mass, were 
smaller at high redshift. Moderate deviations in the 
pace of evolution can be attributed to sample selection-, 
measurement- and/or fitting techniques. For example, 
as we mentioned above, whereas the size measurements 
reported by Williams et al. (2010) are consistent with 
our size measurements, the reported pace of evolution is 
somewhat different: (1+z) -1 from Williams et al. (2010); 
(1+2) -1 - 3 from §3.2 in this paper. This is the result of the 
difference in spanned redshift range and the different use 
of present-day comparison samples. While these differ- 
ences are large enough to be interesting, there is reason- 
able consensus that the average size for the population 
of early-type galaxies evolves rapidly. In particular, the 
first HST/NICMOS-based produced an impressive body 
of evidence for rapid evolution (Zirm 2007; Toft et al. 
2007; van Dokkum et al. 2008; Buitrago et al. 2008), later 
confirmed by HST/WFC3-based studies (e.g., Newman 
et al. 2012; Cassata et al. 2013). 

What has so far remained contentious is what drives 
this evolution: size evolution of individual galaxies, the 
addition of larger galaxies to the population, or a combi- 
nation. Figure 11 (top panel) shows the cumulative size 
histograms of early-type galaxies in the L* mass range, 
which reveals that the number density of small early-type 
galaxies strongly evolves with redshift. The total num- 
ber density of early-type galaxies increases from 2 ~ 3 
to the present, but the number of small galaxies strongly 
decreases at late cosmic times. We show this explicitly 
in Figure 13: the number density of high-mass galaxies 
with small sizes increases from early times to 2 ~ 2, levels 
off, and then decreases strongly at 2 < 1.5. The defini- 
tion of small is arbitrary, but the generally picture does 
not depend on the precise choice in mass and size range. 
This finding is in general agreement with previous claims 
based on smaller samples and fewer fields by Cassata et 
al. (2011), Newman et al. (2012), Szomoru et al. (2012), 
Buitrago et al. (2013), and Cassata et al. (2013). 

Several authors have argued that there are substantial 
numbers of small, yet massive galaxies in the present-day 
universe (Valentinuzzi et al. 2010; Poggianti et al. 2013). 
The latter show that 3 — 5% of present-day group and 
cluster early-type galaxies with mass M* > 3 x 10 10 Mq 
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can be classified as “compact”. Following their defini- 
tion (M*/(27ri? e ff iC i rc ) > 3 x 10 9 M 0 kpc -2 ) we find that 
~ 40 — 50% of z ~ 1 early-type galaxies qualify as com- 
pact. Since the total number density of such galaxies has 
evolved by no more than a factor 2 or 3 we conclude that 
most of the z ~ 1 compact galaxies no longer exist in 
that form in the present-day universe. Several ‘fossils’ 
in the form of R e g ~ 1 kpc, M* ~ 10 11 Mq galaxies 
are found in the local volume (see, e.g., van den Bosch 
et al. 2012; Dullo & Graham 2013, for recent examples), 
but their number density is too low to match the number 
density of their z = 2 counterparts. 

Recently Carollo et al. (2013) claimed that the number 
density of small early-type galaxies in the L* mass range 
has not strongly evolved since z ~ 1. We rule out that 
field-to-field variations explain the discrepancy with our 
results. All five fields show a decline in the number den- 
sity of compact galaxies (as defined in Figure 13, with 
R e s < 2.5 at 10 11 M 0 ) between z = 1.5 and 2 = 0.5, by 
factors ranging from 3 to 10. A decline of more than a 
factor 2 between z = 1 and z = 0.5 is seen for 4 out of 5 
fields. 

There are several factors, in the form of systematic 
effects in the size and mass measurements, that may 
contribute to this tension. Slight redshift-dependent 
shifts in the stellar mass estimates produce changes in 
the size distribution as measured at a fixed mass. Our 
stellar mass estimates for luminous early-type galaxies 
have been demonstrated to show small, if any, shifts 
(<0.1 dex) compared to dynamical mass measurements 
over the redshift range 0 < z < 2 (van de Sande et al. 
2013; Bezanson et al. 2013; Belli et al. 2014). In addition, 
our color-gradient correction would introduce a 14% shift 
oi z = \ sizes relative to z = 0.2 sizes in the Carollo et 
al. sample. 

Most importantly, the size measurements used here 
and by Carollo et al. (2013) are obtained with funda- 
mentally different techniques: here we use parametrized 
profile fits, while Carollo et al. use growth curves. The 
growth-curve method does not take the PSF into account 
at the time of measurement, but relies on a posteriori 
correction. The magnitude of the correction depends on 
the intrinsic structural properties and is inferred from 
simulated size measurements. For example, galaxies with 
measured sizes of ^0.2" receive a negligible correction, 
whereas galaxies with measured sizes of ~0.1" receive a 
factor 2 correction downward (see Figure 2 of Carollo 
et al.). With such strongly size-dependent corrections it 
is difficult to reconsruct the true size distribution, espe- 
cially when those corrections are of similar magnitude as 
the sizes themselves. 

In an explicit example in which we apply a systematic, 
redshift-dependent shift in R e g of order 0. 1-0.2 dex per 
unit redshift we infer non-evolution in the number den- 
sity of compact galaxies. Given this sensitivity to small 
shifts in size we argue that our measurements, which do 
not require systematic size corrections of more than a few 
percent (see Section 2.5 and van der Wei et al. (2012)), 
represent the size distribution with good fidelity across 
the examined redshift range. 

5. DISCUSSION 

5.1. Evolution of Late- Type Galaxies 


Remarkably, the observed pace of size evolution for 
late-type galaxies is essentially the same as the evolu- 
tion of the dark matter halo radius at a fixed mass, 
R oc H(z )~ 2 / 3 , but only when halo mass and radius are 
defined with respect to the critical density. In a ACDM 
universe, if halo mass is parametrized with respect to 
matter density or virial density (assuming top-hat col- 
lapse), then A causes strong departures from a power 
law at late cosmic times. The average evolution between 
z = 2 and the present is R oc H{z)~ lm or oc H(z)~ 124 , 
respectively (Peebles 1980). 

The interpretation of such a comparison is not straight- 
forward. However, our novel measurement of the slope 
and scatter of the size mass relation provides new con- 
straints. The intrinsic scatter in galaxy size remains ap- 
proximately the same at all redshifts (~ 0.16 — 0.19 dex, 
see Figure 6) and is comparable to, but perhaps some- 
what smaller than, to the scatter of 0.25 dex in the halo 
spin parameter (e.g., Maccio et al. 2008). This strongly 
suggests that at all redshifts the sizes of late-type galax- 
ies are set by their dark matter halos, and encourage us 
to examine the relation between galaxy sizes and halo 
properties further. 

The power-law fits presented in §3.1 imply that there 
is very little or no evolution in the slope of the size- 
mass relation; it remains flat, a = d log R e g/d log M* = 
0.22 ± 0.03, at all redshifts 0 < z < 3 (Figure 6, mid- 
dle panel). As argued by Shen et al. (2003), the flat 
slope suggests that the ratio between galaxy mass and 
halo mass is not a constant: if it were, the size-mass 
relation would be steeper (a = 1/3) than observed. 
The underlying assumption is that galaxy size is propor- 
tional to halo size (Kravtsov 2013), which we here take 
to be the case for late-type galaxies. Following Shen 
et al. (2003), we use the observed slope ( a ~ 1/5) to 
constrain the galaxy mass-halo mass relation and find 

m g = Mg a i/Mhaio oc The observation that the 

slope of the size-mass relation does not evolve with red- 
shift provides a very stringent constraint on the models: 
unless a combination of factors conspire to keep this slope 
constant, the most straightforward explanation is that 
the slope of the relation between galaxy and halo mass 
(7) is similar across the redshift range considere here. 

Indeed, entirely independent estimates of the relation- 
ship between galaxy and halo mass, based on clustering 
measurements and abundance matching techniques, pro- 
vide strong evidence that m g depends on halo mass, sim- 
ilarly so at different redshifts (e.g., Conroy & Wechsler 
2009; Moster et al. 2010; Behroozi et al. 2010; Wake et al. 
2011; Moster et al. 2013; Behroozi et al. 2013a). In fact, 
the most recent studies found that 7 = 2/3 for halos in 
the mass range Mhaio ~ 10 11-12 Mq, in agreement with 
what we infer based on the slope of the size-mass rela- 
tion. In addition, Moster et al. (2013) and Behroozi et 
al. (2013a) showed that m g peaks at a similar halo mass 
(~ 10 12 Mq) at all redshifts z < 2, at around a constant 
value of m g ~ 0.05. The implication is that m g does not 
strongly evolve over the (rather narrow) halo mass range 
10 n “ 12 Mq (Behroozi et al. 2013b). 

It is unclear whether the observed pace of galaxy size 
evolution (R e g oc H(z)~ 2 ^ 3 ) implies that R e g evolves 
in proportion to Rhaio as may be expected in the case 
that galaxy size scales with halo size in the present-day 
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universe (Kravtsov 2013): it may be a coincidence that 
the observed pace of evolution is the same as that for halo 
radii with respect to the critical density: it appears more 
natural to expect galaxy sizes to scale with halo mass 
and radius that are defined in terms of matter or virial 
density. In this spirit, the tendency for late-type galaxies 
to display rather slower size evolution than expected has 
been given ample attention in the literature. 

Somerville et al. (2008) argued that because halos 
are less concentrated at high redshift baryonic disks are 
larger in proportion to the virial radii of halos, leading to 
slower size evolution. In addition, Dutton et al. (2011) 
showed that accreting, gaseous disks with a simple, but 
self-consistent prescription for star formation lead to sim- 
ilarly slow evolution of the stellar disk scale radius as a 
result of recycling of gas and radial variations in star for- 
mation. In addition, stellar feedback may have a more 
direct impact on disk sizes as low-angular momentum 
material is ejected (e.g., Mailer & Dekel 2002; Brook et 
al. 2011). 

The sizes we measured are the not strictly disk scale 
lengths, as we sample the whole galaxy, including the 
bulge. Therefore, bulge formation in late-type galaxies 
slows down size evolution as parametrized here. Bulge 
formation can either occur rapidly, through mergers 
(Toomre & Toomre 1972) or clump formation and mi- 
gration in unstable disks (Dekel et al. 2009; Ceverino et 
al. 2010), or gradually, through secular evolution driven 
by non-axisymmetries in the disk potential (Kormendy & 
Kennicutt 2004). The prediction of any of these scenar- 
ios is that the galaxies with higher global Sersic indices 
will have smaller sizes at a given mass. The observation 
that evolution is faster at z > 2 and slower at 2 < 1 
(see §4.1 and Figure 12), combined with the appearance 
of redder, more compact galaxies at late cosmic times, 
suggests that bulge formation plays an important role in 
the evolution of half-light radii of late-type galaxies. 

5.2. Evolution of Massive Early-Type Galaxies 

The co-moving number density of L* early types has 
strongly increased over the redshift range examined in 
this paper (0 < 2 < 3), as was shown by, e.g., Bell et al. 
(2004); Faber et al. (2007); Brown et al. (2007); Ilbert et 
al. (2010); Brammer et al. (2011); Buitrago et al. (2013); 
Muzzin et al. (2013). Here, this is illustrated in Fig- 
ures 10 and 11. Naturally, the progenitors of the newly 
formed early-type galaxies must be looked for among the 
star-forming, late-type population. The skewed size dis- 
tribution of late types toward small sizes (see §3.3 and 
Figure 10) points at the existence of a population of small 
late-type galaxies that span the entire size range seen for 
early- type galaxies. Figure 11 illustrates that this is the 
case over essentially the entire redshift range probed by 
our sample. 

The tail of small, star-forming galaxies shown in Fig- 
ure 11 at z > 1.5 (also see Barro et al. 2013; Williams 
et al. 2014; Barro et al. 2013) may reflect the intriguing 
possibility of a scenario in which such small, yet massive 
star-forming galaxies are the immediate progenitors of 
compact early-type galaxies. Their number density does 
not rapidly change between 2 = 3 and 2 = 1.5, whereas 
the number of early-type galaxies does rapidly increase 
over that redshift range (see Figure 11). This would sug- 
gest the continuous emergence of additional, small late- 


type galaxies that represent a transitional phase between 
the bulk of the late-type population and the early-type 
population, as recently advocated by Dekel & Burkert 
(2014) based on analytical arguments and simulations. 

An alternative interpretation is that the star-forming 
population consists of ‘normal’ late types and a pop- 
ulation of early-type galaxies that revived their star- 
formation activity. The simplest implementation of this 
model, in which these ‘frosting’ early types have the same 
size distribution of the quiescent early types, can be ruled 
out: the skewed size distribution of late types is not 
well described by two log-normal distributions centered 
at the respective peaks of the size distributions for late- 
and early-type galaxies. In general, the size distribution 
of the full population of galaxies (early and late types 
combined) is not observed to be bimodal in the sense 
that there is no clear gap between two fiducial popula- 
tions of small and large galaxies; nor can the size dis- 
tribution be accurately represented by a single Gaussian 
distribution. More complicated models of the ‘frosting’ 
flavor, in which a large, star-forming disk re-assembles 
to surround a compact, quiescent component cannot be 
immediately ruled out. However, such scenarios seem 
implausible as the implied color and mass-to-light ratio 
gradients of such galaxies would likely be stronger than 
observed (Wuyts et al. 2012). Measurements of the stel- 
lar density in the central regions of early- and late-type 
galaxies can be used to provide further constraints. 

Whether or not the small late-type galaxies represent 
a transitional phase, the central idea in the formation of 
early-type galaxies is that the formation of an early-type 
galaxy occurs requires the formation of a concentrated 
stellar body with a high density (e.g., Franx et al. 2008; 
Bell et al. 2012). One possibility is that a substantial 
amount of material flows to the center under the influ- 
ence of mergers (e.g., Di Matteo et al. 2005) or violently 
unstable disks and clump formation/migration (Dekel et 
al. 2009; Ceverino et al. 2010; Dekel & Burkert 2014). It 
remains to be seen whether such processes can reproduce 
the correct stellar density profiles (Wuyts et al. 2010). 

As we showed in §4.2 and Figures 11 and 13 the num- 
ber density of small, compact early-type galaxies strongly 
decreases between 2 ~ 1.5 and the present. This imme- 
diately implies that early-type galaxies, after they first 
form as compact, quiescent objects, have to substantially 
grow in size over time. Combining this with the sugges- 
tion that new early-type galaxies likely form out of the 
smallest late-type galaxies, the implication is that early- 
type galaxies are the most dense (and disk-like in struc- 
ture, e.g., van der Wei et al. 2011; Bruce et al. 2012; 
Chang et al. 2013) immediately after their star forma- 
tion is truncated. The amount of later evolution in size 
and density is dictated by the (non-evolving) slope of the 
size-mass relation and the evolution of its intercept. This 
naturally fits into the general idea that a gas-rich forma- 
tion phase is followed by a more quiescent, dissipationless 
formation phase. 

The scatter in the size-mass relation of ~ 0.15 — 
0.20 dex (see Figure 6) shows that there is some variation 
in amount of dissipative and dissipationless formation, 
yet, the fact that we see little or no evolution in the size 
scatter implies that the amount of dissipation integrated 
over cosmic history does not vary greatly; some early- 
type galaxies may have experienced an intensely dissi- 
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pative phase at early times, while other, but similarly 
massive, galaxies may have continued a less intense star- 
forming phase up until recently. The compact z > 1.5 
early-type galaxies would fall in the former category; the 
large, massive star-forming galaxies at z ~ 0.5 — 1.5 may 
be the progenitors of galaxies in the latter. 

Within this framework, independent evidence for the 
increase in stellar mass of individual early-type galaxies 
by a factor of 2 — 3 between z = 2 and the present (van 
Dokkurn et al. 2010) implies that the growth in size de- 
pends on the growth in mass as A R e g oc AM~ 2 . This 
steep dependence is consistent with a merger scenario. 

Satellite galaxies can be stripped and their stars de- 
posited on large-radius orbits. Direct and stringent con- 
straints on the minor merger rate are difficult to obtain, 
but it has proved to be difficult to observationally confirm 
a sufficiently large minor merger rate to explain the ob- 
served evolution (Newman et al. 2012). Mergers among 
galaxies that occupy the size-mass relation for early-type 
galaxies, that is, pure dry mergers, may not occur at suf- 
ficient rates (e.g., Nipoti et al. 2012). 

Alternatively, mergers between similarly massive 
galaxies with different sizes can induce large changes in 
the size-mass distribution of the population. Assuming 
that the size distribution of progenitors partaking in ma- 
jor mergers is the same as that of the population as a 
whole, a R e g = 1 kpc early-type galaxy at z ~ 2 will 
merge, on average, with a late-type galaxy that is 3 times 
larger. The properties of the merger remnant will de- 
pend on the amount of dissipation and the dynamics of 
the merger, but it is conceivable that the remnant will be 
much larger than the compact progenitor. A dense inner 
region will remain in place, and the strong correlation 
between central density and quiescence implies that the 
remnant is likely to be quiescent as well. 

The mass ratio distribution in the merger history of 
early-type galaxies, and its effect on size evolution, will 
remain a topic of debate. However, merging can account 
for, and is arguably required to explain, the disappear- 
ance from z ~ 2 to the present of disk-like structures 
among L* early types (van der Wei et al. 2011; Bruce 
et al. 2012; Chang et al. 2013), and the observation that 
the most massive galaxies in the present-day universe do 
not have a disk-like structure, but are intrinsically round 
(van der Wei et al. 2009). A combined analysis of the 
evolution of size and morphological properties (see, e.g., 
Huertas-Company et al. 2013) will aid to simultaneously 
interpret size growth and disk destruction. 

The above narrative shows that we have gathered a 
plausible set of mechanisms that may play a role in ex- 
plaining the formation and subsequent evolution of early- 
type galaxies. Despite this we lack the basis of a simple 
analytical framework that is similar to our model for disk 
formation. However, we note that the rapid pace of size 
evolution is very close to the size evolution expected for 
halos as defined by their virial density: R oc H{z)~ 12i 
for halos compares well with R e g oc H(z)~ 129 for mas- 
sive early-type galaxies. If we assume that these galaxies 
only grow through the accretion of other halos and their 
stellar content, then if is perhaps not a coincidence that 
halos and galaxies both follow the evolutionary path ex- 
pected for a dissipationless, top-hat collapse scenario. 


5.3. Evolution of Low-Mass Early-Type Galaxies 

As we noted in §3.2 the slope of the size-mass rela- 
tion for early-type galaxies flattens below stellar mass 
~ 10 10 Mg, and the size evolution is more comparable 
to that of late types than that of early types (see Figure 
9) . This suggests that there is a population of low-mass 
early types that may have formed out of late-type galax- 
ies without going through a transitional phase in which 
high central densities are attained. Stripping of gas from 
satellite galaxies is a natural explanation for such evo- 
lution, and can explain the existence of an excess popu- 
lation of early-type galaxies in clusters that have struc- 
tural properties similar to those of late-type galaxies in 
the field (van der Wei et al. 2010). Satellites are com- 
mon in this mass range in the present-day universe (e.g., 
van den Bosch et al. 2008), but not at higher redshifts, 
lending the stripping scenario more credence based on 
the rapid increase in the co-moving number density since 
z ~ 1—1.5 (see Figures 5 and 10). 

On the other hand, the early types with mass < 
10 10 Mq are ~ 2 times smaller than equally massive 
late types. Disk fading may contribute to this difference, 
but bulge formation and, in general, processes that cause 
more massive galaxies to transform into early-type galax- 
ies, may play a role in the low-mass regime as well. A 
model such as that presented by Peng et al. (2010) can be 
expanded in order to separately reproduce the size-mass 
relations for different types of ‘quenched’ galaxies. 

6. SUMMARY 

In this paper we present the size-mass distribution of 
30,958 galaxies over a large range in mass (> 10 9 M 0 ) 
and redshift (0 < z < 3), distinguishing between early- 
type and late-type galaxies based on their star-formation 
activity. Spectroscopic and photometric redshifts, stel- 
lar masses and rest-frame properties are determined us- 
ing data from the 3D-HST survey and auxiliary, multi- 
wavelength photometric data sets spanning from the U 
band to 8/rm (see §2). Galaxy sizes are measured from 
CANDELS imaging by single-component Sersic profile 
fits to two-dimensional light distributions, with a correc- 
tion for (redshift-dependent) color gradients (§2.5). 

Consistent with previous results, we find that high- 
redshift galaxies are substantially smaller than equally 
massive, present-day counterparts. As is shown in Fig- 
ures 5, 6 and 9, late-type galaxies are, on average, a 
factor ~ 2 smaller at z = 2 than at the present-day, 
whereas for massive early-type galaxies this is a fac- 
tor of ~ 4. We find that the size evolution of late- 
type galaxies is marginally better described as a func- 
tion of the redshift-dependent Hubble parameter, H(z), 
than as a function of the scale factor, 1 + z (Figure 
7). Average, mass-matched sizes of late- and early-type 
galaxies evolve as R e g ex H{z)~ 0 66 oc (1 + z)~' 75 and 
Reg oc H{z)~ 129 oc (1 + z) -1 ’ 48 , respectively (Figure 6 
and Tabic 1). 

High-mass late-type galaxies evolve marginally faster 
than low-mass late- type galaxies (Figure 9 and Table 
2), but the data are consistent with no evolution in 
the overall slope of the size-mass relation. At all red- 
shifts z < 3 we find that the slope is shallow for late- 
type galaxies (R e g oc M 9 22 for galaxies with stellar 
mass M* > 3 x 10 9 M 0 ), and steep for early-type 
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galaxies (R e s oc M t 0,75 for galaxies with stellar mass 
M* > 2 x 10 10 Mq). The size-mass relation for lower- 
mass early-type galaxies is more similar to that of late 
types than that of high-mass early types (§5.3). Once 
cross-contamination between the two classes of galaxies 
and outliers are taken into account (Figure 6 and §3.1), 
we also find no evidence for evolution in the (intrinsic) 
size scatter at a fixed galaxy mass. The implications of 
these results are discussed in §5. 

The data presented here are consistent with essentially 
most published data sets (§4). Because of the sample size 
and dynamic range in mass and redshift, the immedi- 
ate implications of the measurements are less ambiguous 
than was the case for previous studies. In particular, we 
show in Figure 11 that the size distribution of £ ~ 2 
early-type galaxies is significantly different from that of 
any subset of low-redshift galaxies with the same co- 
moving number density; small early-type galaxies, which 
are typical at z ~ 2, do not exist in equal numbers to- 
day (Figure 13) and must therefore undergo strong size 
evolution in the intervening time. 

The size-mass distributions from the 3D-HST and 
CANDELS projects presented here provide a solid frame- 
work for galaxy evolution models. The steadily evolving 
intercept of the size-mass relation, in combination with 
the non-evolving slope and scatter, present tight con- 
straints on how baryons condense and form galaxies at 
the centers of dark matter halos (e.g., §5.1). The different 
assembly mechanisms of early- and late-type galaxies act 
similarly at all redshifts, as evidenced by the very differ- 
ent, but unchanging slopes of their respective size-mass 
relations. 

This work is based on observations taken by the CAN- 
DELS Multi-Cycle Treasury Program (PI: Faber) and 
the 3D-HST Treasury Program (PI: van Dokkum) with 
the NASA/ESA HST, which is operated by the Asso- 
ciation of Universities for Research in Astronomy, Inc., 
under NASA contract NAS5-26555. 
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APPENDIX 

A. COMPLEMENTARY SIZE DISTRIBUTIONS 

Throughout this paper we use the radius R e g as measured along the major axis. Circularized sizes circ = 
R e s^/bJa, where b/a is the projected axis ratio) have often been used in the literature. For this reason we provide 
the circularized size distributions for the early- and late-type samples in Table Al. We stress that since galaxies are 
predominantly oblate, that is, disk-like, using R e g instead of -R e ff,circ is more prudent: i? e ff is (almost) independent of 
inclination, while i? e ff,circ depends on the short, projected axis, which obviously strongly varies with inclination. 

Throughout the paper we distinguish between late- and early-type galaxies on the basis of star formation activity. 
For some purposes it may be more useful to work with the size distributions of the full sample, without separating by 
type. In Table A3 we provide the size distributions of the full sample. 

Finally, since stellar mass is a derived, model-dependent quantity that is potentially suffering from large systematic 
errors, one might be interested in galaxy size as a function of luminosity, which is essentially a directly observed 
quantity. In Table A3 and A2 we provide the size distributions as a function of rest-frame V-band luminosity. 
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Table Al 

Logarithmic size distribution (16% - 84% range) as a function of galaxy mass and redshift. Identical to Table 2, but with circularized 

sizes instead of semi-major axis sizes. 


z 

Early-Type Galaxies 

Late-Type Galaxies 

M* = io 9 - 25 

9.75 

10.25 

10.75 

11.25 

9.25 

9.75 

10.25 

10.75 

11.25 

16% 
0.25 50% 

84% 

-0.07±0.08 

0.17±0.01 

0.31±0.03 

-0.05±0.04 

0.17±0.03 

0.39±0.05 

0.02±0.03 

0.24±0.04 

0.48±0.04 

0.29±0.04 

0.57±0.03 

0.84±0.05 

0.72i0.07 

l.OliO.ll 

0.07i0.01 

0.30i0.01 

0.54i0.01 

0.21i0.02 

0.42i0.01 

0.62i0.02 

0.27i0.02 

0.49i0.03 

0.73i0.02 

0.51i0.03 

0.70i0.02 

0.86i0.05 

0.91i0.04 

1.06i0.06 

16% 
0.75 50% 

84% 

-0.11±0.04 

0.12±0.02 

0.34±0.02 

-0.25±0.02 

0.10±0.02 

0.34±0.03 

-0.11±0.02 

0.13±0.02 

0.33±0.01 

0.11±0.02 

0.36±0.01 

0.56±0.01 

0.57i0.02 

0.74i0.03 

0.93i0.04 

0.04i0.01 

0.26i0.00 

0.47i0.01 

0.17i0.01 

0.39i0.01 

0.58i0.01 

0.27i0.01 

0.49i0.01 

0.66i0.01 

0.39i0.01 

0.61i0.02 

0.78i0.01 

0.64i0.04 

0.82i0.03 

0.98i0.02 

16% 
1.25 50% 

84% 


-0.21T0.02 

0.06±0.03 

0.33±0.03 

-0.23±0.01 

-0.01±0.02 

0.27±0.03 

-0.07±0.02 

0.16±0.02 

0.44±0.03 

0.32i0.04 

0.49i0.03 

0.75i0.03 

-0.05i0.01 

0.19i0.00 

0.41i0.01 

0.10i0.01 

0.32i0.00 

0.51i0.01 

0.23i0.01 

0.42i0.00 

0.60i0.01 

0.36i0.01 

0.53i0.01 

0.69i0.01 

0.51i0.01 

0.67i0.02 

0.83i0.01 

16% 
1.75 50% 

84% 


-0.12±0.06 

0.12±0.04 

0.36±0.04 

-0.36±0.02 

-0.05±0.04 

0.27±0.04 

-0.14±0.02 

0.08±0.02 

0.42±0.03 

0.11i0.06 

0.37i0.03 

0.67i0.03 

-0.09i0.01 

0.15i0.00 

0.36i0.01 

0.02i0.01 

0.24i0.01 

0.46i0.01 

0.16i0.01 

0.38i0.01 

0.55i0.01 

0.24i0.02 

0.47i0.01 

0.64i0.01 

0.41i0.03 

0.58i0.02 

0.75i0.02 

16% 
2.25 50% 

84% 



-0.53±0.09 

-0.12±0.05 

0.25±0.03 

-0.34±0.03 

-0.02±0.03 

0.41±0.06 

0.02i0.05 

0.25i0.05 

0.72i0.12 


-0.06i0.01 

0.17i0.01 

0.38i0.01 

0.06i0.02 

0.29i0.01 

0.49i0.01 

0.16i0.02 

0.39i0.01 

0.56i0.01 

0.29i0.04 

0.52i0.02 

0.70i0.02 

16% 
2.75 50% 

84% 




-0.37±0.05 
-0. 01±0. 04 
0.40i0.06 

-0.08i0.ll 

0.29i0.09 

0.62i0.10 



0. OliO. 02 
0.27i0.01 
0.48i0.02 

0.08i0.05 

0.34i0.01 

0.54i0.02 

0.24i0.03 

0.43i0.02 

0.63i0.06 


Table A2 

Logarithmic size distributions (16% - 84% range) as a function of rest-frame V-band luminosity and redshift. 


z 

Early- Type Galaxies 

Late-Type Galaxies 

L* = io 9 - 25 

9.75 

10.25 

10.75 

11.25 

9.25 

9.75 

10.25 

10.75 

11.25 

16% 
0.25 50% 

84% 

0.04i0.05 

0.27i0.02 

0.44i0.04 

0.06i0.03 

0.28i0.02 

0.51i0.04 

0.21i0.05 

0.45i0.02 

0.72i0.05 

0.61i0.07 

0.72i0.03 

0.99i0.06 


0.16i0.03 

0.42i0.02 

0.64i0.02 

0.31i0.02 

0.57i0.01 

0.76i0.01 

0.42i0.02 

0.69i0.02 

0.87i0.02 

0.68i0.06 

0.92i0.03 

1.07i0.04 


16% 
0.75 50% 

84% 

-0. OliO. 05 
0.24i0.02 
0.41i0.02 

-0.08i0.03 

0.21i0.02 

0.46i0.02 

0.05i0.02 

0.27i0.02 

0.47i0.02 

0.31i0.01 

0.54i0.02 

0.81i0.02 

0.54i0.05 

0.81i0.06 

1.03i0.06 

0.06i0.02 

0.33i0.01 

0.57i0.02 

0.21i0.01 

0.46i0.01 

0.67i0.00 

0.37i0.01 

0.60i0.01 

0.79i0.01 

0.54i0.01 

0.75i0.01 

0.93i0.02 

0.73i0.10 

0.93i0.08 

1.12i0.08 

16% 
1.25 50% 

84% 

-0.10i0.12 

0.19i0.04 

0.45i0.11 

-0.12i0.04 

0.23i0.03 

0.51i0.03 

-0.14i0.02 

0.10i0.01 

0.40i0.02 

0.07i0.02 

0.32i0.01 

0.58i0.03 

0.29i0.03 

0.56i0.03 

0.83i0.04 

0.03i0.02 

0.32i0.01 

0.66i0.04 

O.lliO.Ol 

0.37i0.01 

0.60i0.01 

0.25i0.01 

0.49i0.01 

0.70i0.00 

0.43i0.01 

0.64i0.01 

0.82i0.01 

0.48i0.05 

0.75i0.02 

0.96i0.03 

16% 
1.75 50% 

84% 


-0.04i0.08 

0.27i0.03 

0.53i0.05 

-0.19i0.03 

0.09i0.02 

0.39i0.03 

-0.11i0.03 

0.15i0.02 

0.50i0.04 

0.20i0.03 

0.37i0.02 

0.71i0.04 

0.12i0.03 

0.43i0.04 

0.82i0.05 

0.05i0.01 

0.32i0.01 

0.58i0.01 

0.15i0.01 

0.40i0.01 

0.62i0.01 

0.32i0.01 

0.56i0.01 

0.75i0.01 

0.37i0.08 

0.70i0.03 

0.87i0.01 

16% 
2.25 50% 

84% 



-0.26i0.ll 

0.08i0.04 

0.35i0.08 

-0.26i0.03 

0.07i0.04 

0.50i0.05 

0.00i0.05 

0.24i0.03 

0.49i0.05 


0. OliO. 04 
0.30i0.03 
0.68i0.06 

0.08i0.01 

0.34i0.01 

0.56i0.01 

0.20i0.01 

0.44i0.01 

0.65i0.01 

0.26i0.06 

0.59i0.02 

0.79i0.03 

16% 
2.75 50% 

84% 




-0.27i0.03 

0.03i0.05 

0.42i0.07 

0. OliO. 05 
0.34i0.09 
0.73i0.08 



0.09i0.01 

0.33i0.02 

0.62i0.04 

0.18i0.01 

0.43i0.01 

0.63i0.01 

0.26i0.07 

0.54i0.02 

0.75i0.02 
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Table A3 

Logarithmic size distributions (16% - 84% range) for the full population (early- and late- type galaxies combined) as a function of mass 

and redshift, and rest-frame V-band luminosity and redshift. 


z 

Early+Late Type Galaxies 

M * = 10 9 - 25 

9.75 

10.25 

10.75 

11.25 

L * = 10 9 ' 25 

9.75 

10.25 

10.75 

11.25 

16% 
0.25 50% 

84% 

0.21±0.01 

0.46±0.01 

0.69±0.01 

0.28±0.02 

0.54±0.02 

0.77±0.02 

0.27±0.03 

0.54±0.03 

0.82±0.02 

0.49±0.04 

0.75±0.03 

0.99±0.02 


0.14±0.02 

0.37±0.02 

0.62±0.02 

0.26±0.02 

0.53±0.01 

0.74±0.01 

0.37±0.01 

0.62±0.02 

0.85±0.01 

0.62±0.04 

0.79±0.03 

1.05±0.03 


16% 
0.75 50% 

84% 

0.16±0.01 

0.41±0.01 

0.64±0.01 

0.25±0.01 

0.52±0.01 

0.74±0.01 

0.18±0.01 

0.52±0.01 

0.78±0.01 

0.35±0.02 

0.59±0.01 

0.84±0.01 

0.66±0.02 

0.85±0.01 

1.04±0.02 

0.05±0.02 

0.29±0.01 

0.54±0.02 

0.18±0.01 

0.44±0.01 

0.66±0.00 

0.28±0.01 

0.54±0.01 

0.76±0.01 

0.40±0.02 

0.68±0.01 

0.90±0.01 

0.62±0.06 

0.83±0.04 

1.04±0.04 

16% 
1.25 50% 

84% 


0.21±0.01 

0.47±0.01 

0.68±0.01 

0.15±0.02 

0.52±0.01 

0.74±0.01 

0.23±0.02 

0.57±0.01 

0.80±0.01 

0.49±0.03 

0.74±0.02 

0.94±0.02 

0.02±0.02 

0.30±0.02 

0.64±0.03 

0 . 10 ± 0.01 

0.36±0.01 

0.60±0.01 

0.19±0.01 

0.47±0.01 

0.69±0.01 

0.29±0.01 

0.58±0.01 

0.79±0.01 

0.36±0.04 

0.67±0.04 

0.91±0.03 

16% 
1.75 50% 

84% 


0.15±0.01 

0.42±0.01 

0.65±0.01 

0.12±0.02 

0.48±0.01 

0.69±0.01 

0.09±0.01 

0.48±0.01 

0.74±0.01 

0.34±0.02 

0.64±0.03 

0.83±0.02 


0.05±0.01 

0.32±0.01 

0.58±0.01 

0.12±0.01 

0.39±0.01 

0.61±0.00 

0.16±0.01 

0.51±0.01 

0.73±0.01 

0.22±0.02 

0.51±0.04 

0.83±0.02 

16% 
2.25 50% 

84% 



0.10±0.02 

0.41±0.01 

0.63±0.01 

0.03±0.03 

0.45±0.02 

0.68±0.01 

0.28±0.03 

0.59±0.03 

0.83±0.02 



0.07±0.01 

0.33±0.01 

0.55±0.01 

0.15±0.01 

0.43±0.01 

0.64±0.01 

0.10±0.03 

0.45±0.03 

0.74±0.03 

16% 
2.75 50% 

84% 




0.01±0.03 

0.43±0.01 

0.65±0.02 

0.27±0.06 

0.52±0.03 

0.75±0.04 




0.11±0.02 

0.41±0.01 

0.62±0.01 

0.12±0.06 

0.52±0.03 

0.74±0.02 


